Overview
Brought to you by YData
Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 357234 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 265.4 MiB |
| Average record size in memory | 779.1 B |
Variable types
| Numeric | 8 |
|---|---|
| Text | 4 |
| DateTime | 1 |
| Categorical | 7 |
Anno is highly overall correlated with CRASH_UNIT_ID and 1 other fields | High correlation |
CRASH_UNIT_ID is highly overall correlated with Anno and 1 other fields | High correlation |
MANEUVER is highly overall correlated with UNIT_TYPE | High correlation |
UNIT_TYPE is highly overall correlated with MANEUVER | High correlation |
VEHICLE_ID is highly overall correlated with Anno and 1 other fields | High correlation |
UNIT_TYPE is highly imbalanced (94.8%) | Imbalance |
VEHICLE_DEFECT is highly imbalanced (76.0%) | Imbalance |
VEHICLE_TYPE is highly imbalanced (58.8%) | Imbalance |
VEHICLE_USE is highly imbalanced (67.7%) | Imbalance |
VEHICLE_YEAR is highly skewed (γ1 = 42.64083) | Skewed |
CRASH_UNIT_ID has unique values | Unique |
VEHICLE_ID has unique values | Unique |
Reproduction
| Analysis started | 2024-11-05 17:05:25.549537 |
|---|---|
| Analysis finished | 2024-11-05 17:05:51.690947 |
| Duration | 26.14 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
CRASH_UNIT_ID
Real number (ℝ)
High correlation  Unique 
| Distinct | 357234 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 276828.55 |
| Minimum | 2 |
|---|---|
| Maximum | 561564 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 27192.65 |
| Q1 | 136450.25 |
| median | 274486.5 |
| Q3 | 416253.75 |
| 95-th percentile | 532521.35 |
| Maximum | 561564 |
| Range | 561562 |
| Interquartile range (IQR) | 279803.5 |
Descriptive statistics
| Standard deviation | 162004.35 |
|---|---|
| Coefficient of variation (CV) | 0.58521549 |
| Kurtosis | -1.199653 |
| Mean | 276828.55 |
| Median Absolute Deviation (MAD) | 139857 |
| Skewness | 0.034817138 |
| Sum | 9.889257 × 1010 |
| Variance | 2.6245411 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 481322 | 1 | < 0.1% |
| 561563 | 1 | < 0.1% |
| 40727 | 1 | < 0.1% |
| 394635 | 1 | < 0.1% |
| 394634 | 1 | < 0.1% |
| 7739 | 1 | < 0.1% |
| 4918 | 1 | < 0.1% |
| 4917 | 1 | < 0.1% |
| 86 | 1 | < 0.1% |
| 58 | 1 | < 0.1% |
| Other values (357224) | 357224 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 | |
| 7 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 14 | 1 | |
| 15 | 1 |
| Value | Count | Frequency (%) |
| 561564 | 1 | |
| 561563 | 1 | |
| 561547 | 1 | |
| 561546 | 1 | |
| 561542 | 1 | |
| 561541 | 1 | |
| 561540 | 1 | |
| 561532 | 1 | |
| 561529 | 1 | |
| 561528 | 1 |
RD_NO
Text
| Distinct | 215177 |
|---|---|
| Distinct (%) | 60.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.9 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 79079 ? |
|---|---|
| Unique (%) | 22.1% |
Sample
| 1st row | JC113627 |
|---|---|
| 2nd row | JC113627 |
| 3rd row | JC113637 |
| 4th row | JC113637 |
| 5th row | JC113630 |
| Value | Count | Frequency (%) |
| ja522872 | 9 | < 0.1% |
| jb571997 | 8 | < 0.1% |
| jb174902 | 8 | < 0.1% |
| jb210669 | 7 | < 0.1% |
| ja364229 | 7 | < 0.1% |
| ja518406 | 6 | < 0.1% |
| hz306708 | 6 | < 0.1% |
| jb398994 | 6 | < 0.1% |
| jb249939 | 6 | < 0.1% |
| jb557525 | 6 | < 0.1% |
| Other values (215167) | 357165 |
Most occurring characters
| Value | Count | Frequency (%) |
| J | 283307 | |
| 4 | 269493 | |
| 5 | 250026 | |
| 1 | 249051 | |
| 3 | 246175 | |
| 2 | 244561 | |
| 0 | 183448 | 6.4% |
| 6 | 180385 | 6.3% |
| 7 | 174482 | 6.1% |
| 9 | 172916 | 6.1% |
| Other values (23) | 604028 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2857872 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| J | 283307 | |
| 4 | 269493 | |
| 5 | 250026 | |
| 1 | 249051 | |
| 3 | 246175 | |
| 2 | 244561 | |
| 0 | 183448 | 6.4% |
| 6 | 180385 | 6.3% |
| 7 | 174482 | 6.1% |
| 9 | 172916 | 6.1% |
| Other values (23) | 604028 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2857872 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| J | 283307 | |
| 4 | 269493 | |
| 5 | 250026 | |
| 1 | 249051 | |
| 3 | 246175 | |
| 2 | 244561 | |
| 0 | 183448 | 6.4% |
| 6 | 180385 | 6.3% |
| 7 | 174482 | 6.1% |
| 9 | 172916 | 6.1% |
| Other values (23) | 604028 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2857872 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| J | 283307 | |
| 4 | 269493 | |
| 5 | 250026 | |
| 1 | 249051 | |
| 3 | 246175 | |
| 2 | 244561 | |
| 0 | 183448 | 6.4% |
| 6 | 180385 | 6.3% |
| 7 | 174482 | 6.1% |
| 9 | 172916 | 6.1% |
| Other values (23) | 604028 |
CRASH_DATE
Date
| Distinct | 149378 |
|---|---|
| Distinct (%) | 41.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
| Minimum | 2014-01-18 18:14:00 |
|---|---|
| Maximum | 2019-01-11 23:36:00 |
UNIT_NO
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5158859 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.55405008 |
|---|---|
| Coefficient of variation (CV) | 0.36549589 |
| Kurtosis | 0.98450568 |
| Mean | 1.5158859 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.63476065 |
| Sum | 541526 |
| Variance | 0.30697149 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 181687 | |
| 2 | 167949 | |
| 3 | 6681 | 1.9% |
| 4 | 751 | 0.2% |
| 5 | 126 | < 0.1% |
| 6 | 25 | < 0.1% |
| 7 | 8 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 181687 | |
| 2 | 167949 | |
| 3 | 6681 | 1.9% |
| 4 | 751 | 0.2% |
| 5 | 126 | < 0.1% |
| 6 | 25 | < 0.1% |
| 7 | 8 | < 0.1% |
| 8 | 5 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 8 | < 0.1% |
| 6 | 25 | < 0.1% |
| 5 | 126 | < 0.1% |
| 4 | 751 | 0.2% |
| 3 | 6681 | 1.9% |
| 2 | 167949 | |
| 1 | 181687 |
UNIT_TYPE
Categorical
High correlation  Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.2 MiB |
| DRIVER | |
|---|---|
| PARKED | 4461 |
| DRIVERLESS | 181 |
| NON-CONTACT VEHICLE | 21 |
Length
| Max length | 19 |
|---|---|
| Median length | 6 |
| Mean length | 6.0027909 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DRIVER |
|---|---|
| 2nd row | DRIVER |
| 3rd row | DRIVER |
| 4th row | DRIVER |
| 5th row | DRIVER |
Common Values
| Value | Count | Frequency (%) |
| DRIVER | 352571 | |
| PARKED | 4461 | 1.2% |
| DRIVERLESS | 181 | 0.1% |
| NON-CONTACT VEHICLE | 21 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| driver | 352571 | |
| parked | 4461 | 1.2% |
| driverless | 181 | 0.1% |
| non-contact | 21 | < 0.1% |
| vehicle | 21 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 709965 | |
| E | 357436 | |
| D | 357213 | |
| I | 352773 | |
| V | 352773 | |
| A | 4482 | 0.2% |
| P | 4461 | 0.2% |
| K | 4461 | 0.2% |
| S | 362 | < 0.1% |
| L | 202 | < 0.1% |
| Other values (7) | 273 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2144401 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 709965 | |
| E | 357436 | |
| D | 357213 | |
| I | 352773 | |
| V | 352773 | |
| A | 4482 | 0.2% |
| P | 4461 | 0.2% |
| K | 4461 | 0.2% |
| S | 362 | < 0.1% |
| L | 202 | < 0.1% |
| Other values (7) | 273 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2144401 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 709965 | |
| E | 357436 | |
| D | 357213 | |
| I | 352773 | |
| V | 352773 | |
| A | 4482 | 0.2% |
| P | 4461 | 0.2% |
| K | 4461 | 0.2% |
| S | 362 | < 0.1% |
| L | 202 | < 0.1% |
| Other values (7) | 273 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2144401 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 709965 | |
| E | 357436 | |
| D | 357213 | |
| I | 352773 | |
| V | 352773 | |
| A | 4482 | 0.2% |
| P | 4461 | 0.2% |
| K | 4461 | 0.2% |
| S | 362 | < 0.1% |
| L | 202 | < 0.1% |
| Other values (7) | 273 | < 0.1% |
VEHICLE_ID
Real number (ℝ)
High correlation  Unique 
| Distinct | 357234 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 267320.61 |
| Minimum | 2 |
|---|---|
| Maximum | 535741 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 26253.3 |
| Q1 | 136218.25 |
| median | 266644.5 |
| Q3 | 399975.75 |
| 95-th percentile | 508502.35 |
| Maximum | 535741 |
| Range | 535739 |
| Interquartile range (IQR) | 263757.5 |
Descriptive statistics
| Standard deviation | 154190.82 |
|---|---|
| Coefficient of variation (CV) | 0.57680107 |
| Kurtosis | -1.1870718 |
| Mean | 267320.61 |
| Median Absolute Deviation (MAD) | 131893 |
| Skewness | 0.0020802487 |
| Sum | 9.5496012 × 1010 |
| Variance | 2.3774808 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 460661 | 1 | < 0.1% |
| 535738 | 1 | < 0.1% |
| 39360 | 1 | < 0.1% |
| 379612 | 1 | < 0.1% |
| 379611 | 1 | < 0.1% |
| 7376 | 1 | < 0.1% |
| 4682 | 1 | < 0.1% |
| 4677 | 1 | < 0.1% |
| 83 | 1 | < 0.1% |
| 59 | 1 | < 0.1% |
| Other values (357224) | 357224 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 | |
| 7 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 14 | 1 | |
| 15 | 1 |
| Value | Count | Frequency (%) |
| 535741 | 1 | |
| 535738 | 1 | |
| 535725 | 1 | |
| 535723 | 1 | |
| 535718 | 1 | |
| 535717 | 1 | |
| 535714 | 1 | |
| 535710 | 1 | |
| 535709 | 1 | |
| 535706 | 1 |
MAKE
Text
| Distinct | 578 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.6 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 53 |
| Mean length | 10.015421 |
| Min length | 2 |
Unique
| Unique | 241 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | TOYOTA MOTOR COMPANY, LTD. |
|---|---|
| 2nd row | FORD |
| 3rd row | CHEVROLET |
| 4th row | JEEP |
| 5th row | JEEP |
| Value | Count | Frequency (%) |
| motor | 49014 | 8.8% |
| ltd | 47247 | 8.5% |
| company | 47148 | 8.5% |
| toyota | 47137 | 8.5% |
| chevrolet | 45230 | 8.1% |
| ford | 39557 | 7.1% |
| nissan | 31844 | 5.7% |
| honda | 29325 | 5.3% |
| corp | 17965 | 3.2% |
| dodge | 17705 | 3.2% |
| Other values (915) | 185104 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 463297 | 12.9% |
| T | 286728 | 8.0% |
| A | 256850 | 7.2% |
| N | 234418 | 6.6% |
| R | 233021 | 6.5% |
| E | 218213 | 6.1% |
| 200042 | 5.6% | |
| D | 191875 | 5.4% |
| C | 172025 | 4.8% |
| L | 154434 | 4.3% |
| Other values (32) | 1166946 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3577849 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| O | 463297 | 12.9% |
| T | 286728 | 8.0% |
| A | 256850 | 7.2% |
| N | 234418 | 6.6% |
| R | 233021 | 6.5% |
| E | 218213 | 6.1% |
| 200042 | 5.6% | |
| D | 191875 | 5.4% |
| C | 172025 | 4.8% |
| L | 154434 | 4.3% |
| Other values (32) | 1166946 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3577849 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| O | 463297 | 12.9% |
| T | 286728 | 8.0% |
| A | 256850 | 7.2% |
| N | 234418 | 6.6% |
| R | 233021 | 6.5% |
| E | 218213 | 6.1% |
| 200042 | 5.6% | |
| D | 191875 | 5.4% |
| C | 172025 | 4.8% |
| L | 154434 | 4.3% |
| Other values (32) | 1166946 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3577849 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| O | 463297 | 12.9% |
| T | 286728 | 8.0% |
| A | 256850 | 7.2% |
| N | 234418 | 6.6% |
| R | 233021 | 6.5% |
| E | 218213 | 6.1% |
| 200042 | 5.6% | |
| D | 191875 | 5.4% |
| C | 172025 | 4.8% |
| L | 154434 | 4.3% |
| Other values (32) | 1166946 |
MODEL
Text
| Distinct | 1477 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.2 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 56 |
| Mean length | 8.9797864 |
| Min length | 2 |
Unique
| Unique | 363 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Highlander(beginning vehicle year 2001) |
|---|---|
| 2nd row | EXPLORER |
| 3rd row | MALIBU (CHEVELLE) |
| 4th row | LAREDO |
| 5th row | Liberty |
| Value | Count | Frequency (%) |
| unknown | 52231 | 10.2% |
| nissan | 17263 | 3.4% |
| camry | 15186 | 3.0% |
| altima | 9087 | 1.8% |
| corolla | 8374 | 1.6% |
| civic | 7656 | 1.5% |
| accord | 7298 | 1.4% |
| sport | 7232 | 1.4% |
| chevelle | 7182 | 1.4% |
| malibu | 7178 | 1.4% |
| Other values (1771) | 373721 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 282577 | 8.8% |
| A | 247338 | 7.7% |
| E | 179344 | 5.6% |
| R | 175207 | 5.5% |
| O | 164779 | 5.1% |
| 155174 | 4.8% | |
| C | 134495 | 4.2% |
| U | 122341 | 3.8% |
| S | 120315 | 3.8% |
| I | 106935 | 3.3% |
| Other values (63) | 1519380 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3207885 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 282577 | 8.8% |
| A | 247338 | 7.7% |
| E | 179344 | 5.6% |
| R | 175207 | 5.5% |
| O | 164779 | 5.1% |
| 155174 | 4.8% | |
| C | 134495 | 4.2% |
| U | 122341 | 3.8% |
| S | 120315 | 3.8% |
| I | 106935 | 3.3% |
| Other values (63) | 1519380 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3207885 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 282577 | 8.8% |
| A | 247338 | 7.7% |
| E | 179344 | 5.6% |
| R | 175207 | 5.5% |
| O | 164779 | 5.1% |
| 155174 | 4.8% | |
| C | 134495 | 4.2% |
| U | 122341 | 3.8% |
| S | 120315 | 3.8% |
| I | 106935 | 3.3% |
| Other values (63) | 1519380 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3207885 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 282577 | 8.8% |
| A | 247338 | 7.7% |
| E | 179344 | 5.6% |
| R | 175207 | 5.5% |
| O | 164779 | 5.1% |
| 155174 | 4.8% | |
| C | 134495 | 4.2% |
| U | 122341 | 3.8% |
| S | 120315 | 3.8% |
| I | 106935 | 3.3% |
| Other values (63) | 1519380 |
LIC_PLATE_STATE
Text
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.8 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IL |
|---|---|
| 2nd row | IL |
| 3rd row | IL |
| 4th row | IL |
| 5th row | IL |
| Value | Count | Frequency (%) |
| il | 336201 | |
| in | 6446 | 1.8% |
| wi | 2132 | 0.6% |
| mi | 1643 | 0.5% |
| xx | 959 | 0.3% |
| oh | 888 | 0.2% |
| tx | 835 | 0.2% |
| fl | 809 | 0.2% |
| az | 743 | 0.2% |
| ia | 689 | 0.2% |
| Other values (42) | 5889 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 347166 | |
| L | 337231 | |
| N | 8328 | 1.2% |
| M | 3272 | 0.5% |
| A | 3261 | 0.5% |
| X | 2753 | 0.4% |
| W | 2284 | 0.3% |
| O | 1980 | 0.3% |
| T | 1429 | 0.2% |
| C | 953 | 0.1% |
| Other values (15) | 5811 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 714468 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 347166 | |
| L | 337231 | |
| N | 8328 | 1.2% |
| M | 3272 | 0.5% |
| A | 3261 | 0.5% |
| X | 2753 | 0.4% |
| W | 2284 | 0.3% |
| O | 1980 | 0.3% |
| T | 1429 | 0.2% |
| C | 953 | 0.1% |
| Other values (15) | 5811 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 714468 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 347166 | |
| L | 337231 | |
| N | 8328 | 1.2% |
| M | 3272 | 0.5% |
| A | 3261 | 0.5% |
| X | 2753 | 0.4% |
| W | 2284 | 0.3% |
| O | 1980 | 0.3% |
| T | 1429 | 0.2% |
| C | 953 | 0.1% |
| Other values (15) | 5811 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 714468 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 347166 | |
| L | 337231 | |
| N | 8328 | 1.2% |
| M | 3272 | 0.5% |
| A | 3261 | 0.5% |
| X | 2753 | 0.4% |
| W | 2284 | 0.3% |
| O | 1980 | 0.3% |
| T | 1429 | 0.2% |
| C | 953 | 0.1% |
| Other values (15) | 5811 | 0.8% |
VEHICLE_YEAR
Real number (ℝ)
Skewed 
| Distinct | 130 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2013.8658 |
| Minimum | 1900 |
|---|---|
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1999 |
| Q1 | 2005 |
| median | 2011 |
| Q3 | 2014 |
| 95-th percentile | 2017 |
| Maximum | 9999 |
| Range | 8099 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 185.95222 |
|---|---|
| Coefficient of variation (CV) | 0.092335952 |
| Kurtosis | 1825.8722 |
| Mean | 2013.8658 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 42.64083 |
| Sum | 7.1942134 × 108 |
| Variance | 34578.227 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2015 | 29738 | 8.3% |
| 2014 | 27532 | 7.7% |
| 2016 | 26667 | 7.5% |
| 2013 | 25284 | 7.1% |
| 2012 | 21984 | 6.2% |
| 2017 | 20675 | 5.8% |
| 2007 | 19094 | 5.3% |
| 2011 | 18030 | 5.0% |
| 2008 | 17931 | 5.0% |
| 2006 | 17463 | 4.9% |
| Other values (120) | 132836 |
| Value | Count | Frequency (%) |
| 1900 | 118 | |
| 1901 | 8 | < 0.1% |
| 1905 | 1 | < 0.1% |
| 1911 | 1 | < 0.1% |
| 1941 | 1 | < 0.1% |
| 1951 | 1 | < 0.1% |
| 1952 | 2 | < 0.1% |
| 1960 | 2 | < 0.1% |
| 1961 | 1 | < 0.1% |
| 1962 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9999 | 192 | |
| 6043 | 1 | < 0.1% |
| 5015 | 1 | < 0.1% |
| 5012 | 1 | < 0.1% |
| 5007 | 1 | < 0.1% |
| 3023 | 1 | < 0.1% |
| 3017 | 1 | < 0.1% |
| 3016 | 1 | < 0.1% |
| 3013 | 7 | < 0.1% |
| 3012 | 1 | < 0.1% |
VEHICLE_DEFECT
Categorical
Imbalance 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.8 MiB |
| NONE | |
|---|---|
| UNKNOWN | |
| OTHER | 1406 |
| BRAKES | 1373 |
| TIRES | 202 |
| Other values (12) | 505 |
Length
| Max length | 16 |
|---|---|
| Median length | 4 |
| Mean length | 4.9284503 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NONE |
|---|---|
| 2nd row | NONE |
| 3rd row | NONE |
| 4th row | NONE |
| 5th row | NONE |
Common Values
| Value | Count | Frequency (%) |
| NONE | 245361 | |
| UNKNOWN | 108387 | |
| OTHER | 1406 | 0.4% |
| BRAKES | 1373 | 0.4% |
| TIRES | 202 | 0.1% |
| STEERING | 189 | 0.1% |
| WHEELS | 104 | < 0.1% |
| SUSPENSION | 58 | < 0.1% |
| ENGINE/MOTOR | 42 | < 0.1% |
| FUEL SYSTEM | 29 | < 0.1% |
| Other values (7) | 83 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| none | 245361 | |
| unknown | 108387 | |
| other | 1406 | 0.4% |
| brakes | 1373 | 0.4% |
| tires | 202 | 0.1% |
| steering | 189 | 0.1% |
| wheels | 104 | < 0.1% |
| suspension | 58 | < 0.1% |
| engine/motor | 42 | < 0.1% |
| system | 38 | < 0.1% |
| Other values (9) | 115 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 816312 | |
| O | 355333 | |
| E | 249154 | 14.2% |
| K | 109760 | 6.2% |
| W | 108537 | 6.2% |
| U | 108482 | 6.2% |
| R | 3247 | 0.2% |
| S | 2192 | 0.1% |
| T | 1930 | 0.1% |
| H | 1542 | 0.1% |
| Other values (14) | 4121 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1760610 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 816312 | |
| O | 355333 | |
| E | 249154 | 14.2% |
| K | 109760 | 6.2% |
| W | 108537 | 6.2% |
| U | 108482 | 6.2% |
| R | 3247 | 0.2% |
| S | 2192 | 0.1% |
| T | 1930 | 0.1% |
| H | 1542 | 0.1% |
| Other values (14) | 4121 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1760610 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 816312 | |
| O | 355333 | |
| E | 249154 | 14.2% |
| K | 109760 | 6.2% |
| W | 108537 | 6.2% |
| U | 108482 | 6.2% |
| R | 3247 | 0.2% |
| S | 2192 | 0.1% |
| T | 1930 | 0.1% |
| H | 1542 | 0.1% |
| Other values (14) | 4121 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1760610 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 816312 | |
| O | 355333 | |
| E | 249154 | 14.2% |
| K | 109760 | 6.2% |
| W | 108537 | 6.2% |
| U | 108482 | 6.2% |
| R | 3247 | 0.2% |
| S | 2192 | 0.1% |
| T | 1930 | 0.1% |
| H | 1542 | 0.1% |
| Other values (14) | 4121 | 0.2% |
VEHICLE_TYPE
Categorical
Imbalance 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.3 MiB |
| PASSENGER | |
|---|---|
| SPORT UTILITY VEHICLE (SUV) | |
| VAN/MINI-VAN | 20032 |
| PICKUP | 10144 |
| UNKNOWN/NA | 10043 |
| Other values (12) | 21448 |
Length
| Max length | 27 |
|---|---|
| Median length | 9 |
| Mean length | 12.082845 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SPORT UTILITY VEHICLE (SUV) |
|---|---|
| 2nd row | SPORT UTILITY VEHICLE (SUV) |
| 3rd row | PASSENGER |
| 4th row | PASSENGER |
| 5th row | PASSENGER |
Common Values
| Value | Count | Frequency (%) |
| PASSENGER | 246201 | |
| SPORT UTILITY VEHICLE (SUV) | 49366 | 13.8% |
| VAN/MINI-VAN | 20032 | 5.6% |
| PICKUP | 10144 | 2.8% |
| UNKNOWN/NA | 10043 | 2.8% |
| TRUCK - SINGLE UNIT | 7007 | 2.0% |
| BUS OVER 15 PASS. | 4400 | 1.2% |
| OTHER | 3791 | 1.1% |
| TRACTOR W/ SEMI-TRAILER | 3432 | 1.0% |
| BUS UP TO 15 PASS. | 813 | 0.2% |
| Other values (7) | 2005 | 0.6% |
Length
| Value | Count | Frequency (%) |
| passenger | 246201 | |
| vehicle | 49926 | 9.0% |
| sport | 49366 | 8.9% |
| utility | 49366 | 8.9% |
| suv | 49366 | 8.9% |
| van/mini-van | 20032 | 3.6% |
| pickup | 10144 | 1.8% |
| unknown/na | 10043 | 1.8% |
| truck | 7007 | 1.3% |
| 7007 | 1.3% | |
| Other values (25) | 55512 | 10.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 618185 | |
| S | 617677 | |
| N | 360683 | 8.4% |
| R | 329724 | 7.6% |
| P | 321885 | 7.5% |
| A | 310037 | 7.2% |
| G | 253208 | 5.9% |
| I | 221960 | 5.1% |
| 196736 | 4.6% | |
| T | 181050 | 4.2% |
| Other values (22) | 905258 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4316403 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 618185 | |
| S | 617677 | |
| N | 360683 | 8.4% |
| R | 329724 | 7.6% |
| P | 321885 | 7.5% |
| A | 310037 | 7.2% |
| G | 253208 | 5.9% |
| I | 221960 | 5.1% |
| 196736 | 4.6% | |
| T | 181050 | 4.2% |
| Other values (22) | 905258 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4316403 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 618185 | |
| S | 617677 | |
| N | 360683 | 8.4% |
| R | 329724 | 7.6% |
| P | 321885 | 7.5% |
| A | 310037 | 7.2% |
| G | 253208 | 5.9% |
| I | 221960 | 5.1% |
| 196736 | 4.6% | |
| T | 181050 | 4.2% |
| Other values (22) | 905258 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4316403 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 618185 | |
| S | 617677 | |
| N | 360683 | 8.4% |
| R | 329724 | 7.6% |
| P | 321885 | 7.5% |
| A | 310037 | 7.2% |
| G | 253208 | 5.9% |
| I | 221960 | 5.1% |
| 196736 | 4.6% | |
| T | 181050 | 4.2% |
| Other values (22) | 905258 |
VEHICLE_USE
Categorical
Imbalance 
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.1 MiB |
| PERSONAL | |
|---|---|
| UNKNOWN/NA | |
| OTHER | 11072 |
| TAXI/FOR HIRE | 10490 |
| COMMERCIAL - SINGLE UNIT | 5152 |
| Other values (20) | 21337 |
Length
| Max length | 28 |
|---|---|
| Median length | 8 |
| Mean length | 8.8046854 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PERSONAL |
|---|---|
| 2nd row | PERSONAL |
| 3rd row | PERSONAL |
| 4th row | PERSONAL |
| 5th row | PERSONAL |
Common Values
| Value | Count | Frequency (%) |
| PERSONAL | 269825 | |
| UNKNOWN/NA | 39358 | 11.0% |
| OTHER | 11072 | 3.1% |
| TAXI/FOR HIRE | 10490 | 2.9% |
| COMMERCIAL - SINGLE UNIT | 5152 | 1.4% |
| RIDESHARE SERVICE | 3760 | 1.1% |
| OTHER TRANSIT | 2884 | 0.8% |
| NOT IN USE | 2416 | 0.7% |
| CTA | 2310 | 0.6% |
| POLICE | 2183 | 0.6% |
| Other values (15) | 7784 | 2.2% |
Length
| Value | Count | Frequency (%) |
| personal | 269825 | |
| unknown/na | 39358 | 9.8% |
| other | 13956 | 3.5% |
| taxi/for | 10490 | 2.6% |
| hire | 10490 | 2.6% |
| 7016 | 1.7% | |
| commercial | 6988 | 1.7% |
| single | 5177 | 1.3% |
| unit | 5177 | 1.3% |
| rideshare | 3760 | 0.9% |
| Other values (28) | 29551 | 7.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 458887 | |
| O | 353147 | |
| A | 342746 | |
| E | 331864 | |
| R | 330104 | |
| S | 294690 | |
| L | 288237 | |
| P | 272216 | |
| I | 62365 | 2.0% |
| U | 55408 | 1.8% |
| Other values (16) | 355669 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3145333 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 458887 | |
| O | 353147 | |
| A | 342746 | |
| E | 331864 | |
| R | 330104 | |
| S | 294690 | |
| L | 288237 | |
| P | 272216 | |
| I | 62365 | 2.0% |
| U | 55408 | 1.8% |
| Other values (16) | 355669 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3145333 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 458887 | |
| O | 353147 | |
| A | 342746 | |
| E | 331864 | |
| R | 330104 | |
| S | 294690 | |
| L | 288237 | |
| P | 272216 | |
| I | 62365 | 2.0% |
| U | 55408 | 1.8% |
| Other values (16) | 355669 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3145333 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 458887 | |
| O | 353147 | |
| A | 342746 | |
| E | 331864 | |
| R | 330104 | |
| S | 294690 | |
| L | 288237 | |
| P | 272216 | |
| I | 62365 | 2.0% |
| U | 55408 | 1.8% |
| Other values (16) | 355669 |
TRAVEL_DIRECTION
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.6 MiB |
| N | |
|---|---|
| S | |
| W | |
| E | |
| UNKNOWN | 8439 |
| Other values (4) |
Length
| Max length | 7 |
|---|---|
| Median length | 1 |
| Mean length | 1.192073 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S |
|---|---|
| 2nd row | E |
| 3rd row | N |
| 4th row | S |
| 5th row | E |
Common Values
| Value | Count | Frequency (%) |
| N | 88375 | |
| S | 85431 | |
| W | 79472 | |
| E | 77536 | |
| UNKNOWN | 8439 | 2.4% |
| SE | 5314 | 1.5% |
| NW | 4708 | 1.3% |
| SW | 3999 | 1.1% |
| NE | 3960 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| n | 88375 | |
| s | 85431 | |
| w | 79472 | |
| e | 77536 | |
| unknown | 8439 | 2.4% |
| se | 5314 | 1.5% |
| nw | 4708 | 1.3% |
| sw | 3999 | 1.1% |
| ne | 3960 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 122360 | |
| W | 96618 | |
| S | 94744 | |
| E | 86810 | |
| U | 8439 | 2.0% |
| K | 8439 | 2.0% |
| O | 8439 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 425849 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 122360 | |
| W | 96618 | |
| S | 94744 | |
| E | 86810 | |
| U | 8439 | 2.0% |
| K | 8439 | 2.0% |
| O | 8439 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 425849 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 122360 | |
| W | 96618 | |
| S | 94744 | |
| E | 86810 | |
| U | 8439 | 2.0% |
| K | 8439 | 2.0% |
| O | 8439 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 425849 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 122360 | |
| W | 96618 | |
| S | 94744 | |
| E | 86810 | |
| U | 8439 | 2.0% |
| K | 8439 | 2.0% |
| O | 8439 | 2.0% |
MANEUVER
Categorical
High correlation 
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 27.0 MiB |
| STRAIGHT AHEAD | |
|---|---|
| SLOW/STOP IN TRAFFIC | |
| TURNING LEFT | |
| BACKING | 18299 |
| TURNING RIGHT | 14060 |
| Other values (22) |
Length
| Max length | 34 |
|---|---|
| Median length | 14 |
| Mean length | 14.39171 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | STRAIGHT AHEAD |
|---|---|
| 2nd row | STRAIGHT AHEAD |
| 3rd row | STRAIGHT AHEAD |
| 4th row | STRAIGHT AHEAD |
| 5th row | CHANGING LANES |
Common Values
| Value | Count | Frequency (%) |
| STRAIGHT AHEAD | 193521 | |
| SLOW/STOP IN TRAFFIC | 37310 | 10.4% |
| TURNING LEFT | 24993 | 7.0% |
| BACKING | 18299 | 5.1% |
| TURNING RIGHT | 14060 | 3.9% |
| UNKNOWN/NA | 11352 | 3.2% |
| PASSING/OVERTAKING | 9238 | 2.6% |
| CHANGING LANES | 8983 | 2.5% |
| OTHER | 6741 | 1.9% |
| ENTERING TRAFFIC LANE FROM PARKING | 5383 | 1.5% |
| Other values (17) | 27354 | 7.7% |
Length
| Value | Count | Frequency (%) |
| straight | 193521 | |
| ahead | 193521 | |
| traffic | 48255 | 6.6% |
| slow/stop | 42568 | 5.8% |
| in | 40484 | 5.5% |
| turning | 39196 | 5.3% |
| left | 27467 | 3.7% |
| backing | 18299 | 2.5% |
| right | 15576 | 2.1% |
| unknown/na | 11352 | 1.5% |
| Other values (34) | 105080 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 728445 | |
| T | 602618 | |
| H | 420328 | |
| I | 415567 | |
| 378085 | 7.4% | |
| R | 356877 | 6.9% |
| G | 331248 | 6.4% |
| S | 318640 | 6.2% |
| E | 289309 | 5.6% |
| N | 277762 | 5.4% |
| Other values (16) | 1022329 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5141208 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 728445 | |
| T | 602618 | |
| H | 420328 | |
| I | 415567 | |
| 378085 | 7.4% | |
| R | 356877 | 6.9% |
| G | 331248 | 6.4% |
| S | 318640 | 6.2% |
| E | 289309 | 5.6% |
| N | 277762 | 5.4% |
| Other values (16) | 1022329 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5141208 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 728445 | |
| T | 602618 | |
| H | 420328 | |
| I | 415567 | |
| 378085 | 7.4% | |
| R | 356877 | 6.9% |
| G | 331248 | 6.4% |
| S | 318640 | 6.2% |
| E | 289309 | 5.6% |
| N | 277762 | 5.4% |
| Other values (16) | 1022329 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5141208 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 728445 | |
| T | 602618 | |
| H | 420328 | |
| I | 415567 | |
| 378085 | 7.4% | |
| R | 356877 | 6.9% |
| G | 331248 | 6.4% |
| S | 318640 | 6.2% |
| E | 289309 | 5.6% |
| N | 277762 | 5.4% |
| Other values (16) | 1022329 |
OCCUPANT_CNT
Real number (ℝ)
| Distinct | 39 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2815661 |
| Minimum | 0 |
|---|---|
| Maximum | 60 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 60 |
| Range | 60 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.7764914 |
|---|---|
| Coefficient of variation (CV) | 0.60589257 |
| Kurtosis | 351.00878 |
| Mean | 1.2815661 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.10007 |
| Sum | 457819 |
| Variance | 0.60293889 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 290494 | |
| 2 | 46154 | 12.9% |
| 3 | 12849 | 3.6% |
| 4 | 5118 | 1.4% |
| 5 | 1707 | 0.5% |
| 6 | 461 | 0.1% |
| 7 | 184 | 0.1% |
| 8 | 76 | < 0.1% |
| 9 | 39 | < 0.1% |
| 11 | 26 | < 0.1% |
| Other values (29) | 126 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8 | < 0.1% |
| 1 | 290494 | |
| 2 | 46154 | 12.9% |
| 3 | 12849 | 3.6% |
| 4 | 5118 | 1.4% |
| 5 | 1707 | 0.5% |
| 6 | 461 | 0.1% |
| 7 | 184 | 0.1% |
| 8 | 76 | < 0.1% |
| 9 | 39 | < 0.1% |
| Value | Count | Frequency (%) |
| 60 | 1 | |
| 44 | 1 | |
| 43 | 1 | |
| 41 | 1 | |
| 39 | 2 | |
| 37 | 1 | |
| 36 | 2 | |
| 35 | 2 | |
| 34 | 1 | |
| 33 | 2 |
FIRST_CONTACT_POINT
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.8 MiB |
| FRONT | |
|---|---|
| REAR | |
| FRONT-RIGHT | |
| FRONT-LEFT | |
| SIDE-RIGHT | |
| Other values (9) |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 7.7099492 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FRONT-RIGHT |
|---|---|
| 2nd row | FRONT-LEFT |
| 3rd row | SIDE-LEFT |
| 4th row | SIDE-LEFT |
| 5th row | SIDE-RIGHT |
Common Values
| Value | Count | Frequency (%) |
| FRONT | 80274 | |
| REAR | 60846 | |
| FRONT-RIGHT | 52156 | |
| FRONT-LEFT | 49758 | |
| SIDE-RIGHT | 26221 | 7.3% |
| SIDE-LEFT | 23064 | 6.5% |
| REAR-LEFT | 22898 | 6.4% |
| REAR-RIGHT | 21100 | 5.9% |
| UNKNOWN | 12276 | 3.4% |
| NONE | 4032 | 1.1% |
| Other values (4) | 4609 | 1.3% |
Length
| Value | Count | Frequency (%) |
| front | 80274 | |
| rear | 60846 | |
| front-right | 52156 | |
| front-left | 49758 | |
| side-right | 26221 | 7.3% |
| side-left | 23064 | 6.4% |
| rear-left | 22898 | 6.3% |
| rear-right | 21100 | 5.8% |
| unknown | 12276 | 3.4% |
| none | 4032 | 1.1% |
| Other values (7) | 8740 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 497312 | |
| T | 382469 | |
| F | 278486 | |
| E | 258587 | |
| N | 227755 | |
| O | 203008 | |
| - | 195197 | 7.1% |
| I | 149437 | 5.4% |
| A | 113106 | 4.1% |
| H | 101105 | 3.7% |
| Other values (11) | 347794 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2754256 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 497312 | |
| T | 382469 | |
| F | 278486 | |
| E | 258587 | |
| N | 227755 | |
| O | 203008 | |
| - | 195197 | 7.1% |
| I | 149437 | 5.4% |
| A | 113106 | 4.1% |
| H | 101105 | 3.7% |
| Other values (11) | 347794 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2754256 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 497312 | |
| T | 382469 | |
| F | 278486 | |
| E | 258587 | |
| N | 227755 | |
| O | 203008 | |
| - | 195197 | 7.1% |
| I | 149437 | 5.4% |
| A | 113106 | 4.1% |
| H | 101105 | 3.7% |
| Other values (11) | 347794 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2754256 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 497312 | |
| T | 382469 | |
| F | 278486 | |
| E | 258587 | |
| N | 227755 | |
| O | 203008 | |
| - | 195197 | 7.1% |
| I | 149437 | 5.4% |
| A | 113106 | 4.1% |
| H | 101105 | 3.7% |
| Other values (11) | 347794 |
Anno
Real number (ℝ)
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2017.2293 |
| Minimum | 2014 |
|---|---|
| Maximum | 2019 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 2014 |
|---|---|
| 5-th percentile | 2016 |
| Q1 | 2017 |
| median | 2017 |
| Q3 | 2018 |
| 95-th percentile | 2018 |
| Maximum | 2019 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.87329284 |
|---|---|
| Coefficient of variation (CV) | 0.00043291699 |
| Kurtosis | -0.27791011 |
| Mean | 2017.2293 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.71394876 |
| Sum | 7.206229 × 108 |
| Variance | 0.76264038 |
| Monotonicity | Decreasing |
| Value | Count | Frequency (%) |
| 2018 | 162373 | |
| 2017 | 117300 | |
| 2016 | 60477 | 16.9% |
| 2015 | 13526 | 3.8% |
| 2019 | 3550 | 1.0% |
| 2014 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 2014 | 8 | < 0.1% |
| 2015 | 13526 | 3.8% |
| 2016 | 60477 | 16.9% |
| 2017 | 117300 | |
| 2018 | 162373 | |
| 2019 | 3550 | 1.0% |
| Value | Count | Frequency (%) |
| 2019 | 3550 | 1.0% |
| 2018 | 162373 | |
| 2017 | 117300 | |
| 2016 | 60477 | 16.9% |
| 2015 | 13526 | 3.8% |
| 2014 | 8 | < 0.1% |
Mese
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.0975327 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.4564858 |
|---|---|
| Coefficient of variation (CV) | 0.48699822 |
| Kurtosis | -1.1541951 |
| Mean | 7.0975327 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.24754956 |
| Sum | 2535480 |
| Variance | 11.947294 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 38676 | |
| 12 | 37932 | |
| 11 | 36325 | |
| 9 | 34367 | |
| 8 | 31415 | |
| 7 | 27762 | |
| 5 | 27481 | |
| 6 | 27218 | |
| 1 | 26570 | |
| 4 | 24371 | |
| Other values (2) | 45117 |
| Value | Count | Frequency (%) |
| 1 | 26570 | |
| 2 | 21114 | |
| 3 | 24003 | |
| 4 | 24371 | |
| 5 | 27481 | |
| 6 | 27218 | |
| 7 | 27762 | |
| 8 | 31415 | |
| 9 | 34367 | |
| 10 | 38676 |
| Value | Count | Frequency (%) |
| 12 | 37932 | |
| 11 | 36325 | |
| 10 | 38676 | |
| 9 | 34367 | |
| 8 | 31415 | |
| 7 | 27762 | |
| 6 | 27218 | |
| 5 | 27481 | |
| 4 | 24371 | |
| 3 | 24003 |
Giorno
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.527167 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 15 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.80583 |
|---|---|
| Coefficient of variation (CV) | 0.56712406 |
| Kurtosis | -1.1801433 |
| Mean | 15.527167 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.042023058 |
| Sum | 5546832 |
| Variance | 77.542643 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 12457 | 3.5% |
| 1 | 12342 | 3.5% |
| 5 | 12339 | 3.5% |
| 13 | 12319 | 3.4% |
| 2 | 12168 | 3.4% |
| 14 | 12167 | 3.4% |
| 17 | 12158 | 3.4% |
| 3 | 12085 | 3.4% |
| 6 | 11962 | 3.3% |
| 16 | 11896 | 3.3% |
| Other values (21) | 235341 |
| Value | Count | Frequency (%) |
| 1 | 12342 | |
| 2 | 12168 | |
| 3 | 12085 | |
| 4 | 11877 | |
| 5 | 12339 | |
| 6 | 11962 | |
| 7 | 11863 | |
| 8 | 11500 | |
| 9 | 11659 | |
| 10 | 12457 |
| Value | Count | Frequency (%) |
| 31 | 7237 | |
| 30 | 10746 | |
| 29 | 10825 | |
| 28 | 11119 | |
| 27 | 11143 | |
| 26 | 11182 | |
| 25 | 10648 | |
| 24 | 10822 | |
| 23 | 11770 | |
| 22 | 11186 |
Interactions
Correlations
| Anno | CRASH_UNIT_ID | FIRST_CONTACT_POINT | Giorno | MANEUVER | Mese | OCCUPANT_CNT | TRAVEL_DIRECTION | UNIT_NO | UNIT_TYPE | VEHICLE_DEFECT | VEHICLE_ID | VEHICLE_TYPE | VEHICLE_USE | VEHICLE_YEAR | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Anno | 1.000 | 0.930 | 0.024 | -0.038 | 0.024 | -0.203 | 0.024 | 0.014 | -0.012 | 0.006 | 0.014 | 0.930 | 0.026 | 0.043 | 0.120 |
| CRASH_UNIT_ID | 0.930 | 1.000 | 0.023 | -0.002 | 0.024 | 0.144 | 0.025 | 0.014 | -0.011 | 0.006 | 0.013 | 1.000 | 0.026 | 0.035 | 0.126 |
| FIRST_CONTACT_POINT | 0.024 | 0.023 | 1.000 | 0.003 | 0.168 | 0.008 | 0.007 | 0.053 | 0.163 | 0.072 | 0.060 | 0.023 | 0.087 | 0.092 | 0.025 |
| Giorno | -0.038 | -0.002 | 0.003 | 1.000 | 0.004 | 0.027 | -0.002 | 0.004 | -0.000 | 0.000 | 0.003 | -0.002 | 0.004 | 0.003 | -0.001 |
| MANEUVER | 0.024 | 0.024 | 0.168 | 0.004 | 1.000 | 0.014 | 0.007 | 0.128 | 0.155 | 0.590 | 0.055 | 0.024 | 0.061 | 0.080 | 0.024 |
| Mese | -0.203 | 0.144 | 0.008 | 0.027 | 0.014 | 1.000 | 0.002 | 0.008 | 0.001 | 0.003 | 0.004 | 0.144 | 0.014 | 0.010 | 0.014 |
| OCCUPANT_CNT | 0.024 | 0.025 | 0.007 | -0.002 | 0.007 | 0.002 | 1.000 | 0.000 | 0.095 | 0.000 | 0.016 | 0.025 | 0.064 | 0.074 | 0.015 |
| TRAVEL_DIRECTION | 0.014 | 0.014 | 0.053 | 0.004 | 0.128 | 0.008 | 0.000 | 1.000 | 0.022 | 0.029 | 0.026 | 0.014 | 0.033 | 0.033 | 0.021 |
| UNIT_NO | -0.012 | -0.011 | 0.163 | -0.000 | 0.155 | 0.001 | 0.095 | 0.022 | 1.000 | 0.050 | 0.081 | -0.011 | 0.044 | 0.083 | 0.121 |
| UNIT_TYPE | 0.006 | 0.006 | 0.072 | 0.000 | 0.590 | 0.003 | 0.000 | 0.029 | 0.050 | 1.000 | 0.015 | 0.006 | 0.016 | 0.137 | 0.000 |
| VEHICLE_DEFECT | 0.014 | 0.013 | 0.060 | 0.003 | 0.055 | 0.004 | 0.016 | 0.026 | 0.081 | 0.015 | 1.000 | 0.013 | 0.060 | 0.108 | 0.020 |
| VEHICLE_ID | 0.930 | 1.000 | 0.023 | -0.002 | 0.024 | 0.144 | 0.025 | 0.014 | -0.011 | 0.006 | 0.013 | 1.000 | 0.025 | 0.035 | 0.126 |
| VEHICLE_TYPE | 0.026 | 0.026 | 0.087 | 0.004 | 0.061 | 0.014 | 0.064 | 0.033 | 0.044 | 0.016 | 0.060 | 0.025 | 1.000 | 0.326 | 0.042 |
| VEHICLE_USE | 0.043 | 0.035 | 0.092 | 0.003 | 0.080 | 0.010 | 0.074 | 0.033 | 0.083 | 0.137 | 0.108 | 0.035 | 0.326 | 1.000 | 0.022 |
| VEHICLE_YEAR | 0.120 | 0.126 | 0.025 | -0.001 | 0.024 | 0.014 | 0.015 | 0.021 | 0.121 | 0.000 | 0.020 | 0.126 | 0.042 | 0.022 | 1.000 |
Missing values
Sample
| CRASH_UNIT_ID | RD_NO | CRASH_DATE | UNIT_NO | UNIT_TYPE | VEHICLE_ID | MAKE | MODEL | LIC_PLATE_STATE | VEHICLE_YEAR | VEHICLE_DEFECT | VEHICLE_TYPE | VEHICLE_USE | TRAVEL_DIRECTION | MANEUVER | OCCUPANT_CNT | FIRST_CONTACT_POINT | Anno | Mese | Giorno | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 561563 | JC113627 | 2019-01-11 23:36:00 | 2 | DRIVER | 535738.0 | TOYOTA MOTOR COMPANY, LTD. | Highlander(beginning vehicle year 2001) | IL | 2003.0 | NONE | SPORT UTILITY VEHICLE (SUV) | PERSONAL | S | STRAIGHT AHEAD | 1.0 | FRONT-RIGHT | 2019 | 1 | 11 |
| 2 | 561564 | JC113627 | 2019-01-11 23:36:00 | 1 | DRIVER | 535741.0 | FORD | EXPLORER | IL | 2001.0 | NONE | SPORT UTILITY VEHICLE (SUV) | PERSONAL | E | STRAIGHT AHEAD | 1.0 | FRONT-LEFT | 2019 | 1 | 11 |
| 3 | 561540 | JC113637 | 2019-01-11 23:31:00 | 1 | DRIVER | 535714.0 | CHEVROLET | MALIBU (CHEVELLE) | IL | 2013.0 | NONE | PASSENGER | PERSONAL | N | STRAIGHT AHEAD | 1.0 | SIDE-LEFT | 2019 | 1 | 11 |
| 4 | 561541 | JC113637 | 2019-01-11 23:31:00 | 2 | DRIVER | 535718.0 | JEEP | LAREDO | IL | 2016.0 | NONE | PASSENGER | PERSONAL | S | STRAIGHT AHEAD | 1.0 | SIDE-LEFT | 2019 | 1 | 11 |
| 5 | 561542 | JC113630 | 2019-01-11 23:22:00 | 1 | DRIVER | 535717.0 | JEEP | Liberty | IL | 2015.0 | NONE | PASSENGER | PERSONAL | E | CHANGING LANES | 1.0 | SIDE-RIGHT | 2019 | 1 | 11 |
| 7 | 561528 | JC113604 | 2019-01-11 23:08:00 | 2 | DRIVER | 535706.0 | TOYOTA MOTOR COMPANY, LTD. | CAMRY | IL | 2007.0 | UNKNOWN | PASSENGER | TAXI/FOR HIRE | W | STRAIGHT AHEAD | 1.0 | REAR | 2019 | 1 | 11 |
| 9 | 561514 | JC113579 | 2019-01-11 22:45:00 | 2 | DRIVER | 535694.0 | PONTIAC | BONNEVILLE | IL | 2002.0 | NONE | PASSENGER | PERSONAL | S | AVOIDING VEHICLES/OBJECTS | 2.0 | REAR | 2019 | 1 | 11 |
| 10 | 561546 | JC113617 | 2019-01-11 22:28:00 | 1 | DRIVER | 535723.0 | BUICK | REGAL | IL | 2011.0 | UNKNOWN | PASSENGER | PERSONAL | N | STRAIGHT AHEAD | 1.0 | FRONT | 2019 | 1 | 11 |
| 11 | 561547 | JC113617 | 2019-01-11 22:28:00 | 2 | DRIVER | 535725.0 | CHEVROLET | UNKNOWN | IL | 2010.0 | UNKNOWN | PASSENGER | PERSONAL | N | STRAIGHT AHEAD | 2.0 | REAR | 2019 | 1 | 11 |
| 13 | 561516 | JC113568 | 2019-01-11 22:16:00 | 2 | DRIVER | 535695.0 | DODGE | DART | IL | 2014.0 | UNKNOWN | PASSENGER | PERSONAL | S | STRAIGHT AHEAD | 1.0 | FRONT-RIGHT | 2019 | 1 | 11 |
| CRASH_UNIT_ID | RD_NO | CRASH_DATE | UNIT_NO | UNIT_TYPE | VEHICLE_ID | MAKE | MODEL | LIC_PLATE_STATE | VEHICLE_YEAR | VEHICLE_DEFECT | VEHICLE_TYPE | VEHICLE_USE | TRAVEL_DIRECTION | MANEUVER | OCCUPANT_CNT | FIRST_CONTACT_POINT | Anno | Mese | Giorno | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 460426 | 560784 | JC111992 | 2015-01-10 17:15:00 | 1 | DRIVER | 534993.0 | FORD | UNKNOWN | IL | 2012.0 | UNKNOWN | UNKNOWN/NA | UNKNOWN/NA | UNKNOWN | STRAIGHT AHEAD | 1.0 | FRONT | 2015 | 1 | 10 |
| 460427 | 560785 | JC111992 | 2015-01-10 17:15:00 | 2 | DRIVER | 535002.0 | CHEVROLET | MALIBU (CHEVELLE) | IL | 2009.0 | NONE | PASSENGER | PERSONAL | UNKNOWN | STRAIGHT AHEAD | 1.0 | REAR | 2015 | 1 | 10 |
| 460429 | 70308 | HZ400518 | 2014-08-20 16:50:00 | 1 | DRIVER | 67994.0 | PONTIAC | UNKNOWN | IL | 2002.0 | NONE | PASSENGER | PERSONAL | E | SLOW/STOP IN TRAFFIC | 1.0 | FRONT | 2014 | 8 | 20 |
| 460430 | 70309 | HZ400518 | 2014-08-20 16:50:00 | 2 | DRIVER | 67999.0 | KIA MOTORS CORP | Rio | IL | 2002.0 | NONE | PASSENGER | PERSONAL | E | SLOW/STOP IN TRAFFIC | 1.0 | REAR | 2014 | 8 | 20 |
| 460431 | 30750 | HZ164689 | 2014-02-24 19:45:00 | 1 | DRIVER | 29699.0 | VOLVO | UNKNOWN | IL | 2004.0 | NONE | PASSENGER | PERSONAL | S | STRAIGHT AHEAD | 1.0 | FRONT | 2014 | 2 | 24 |
| 460432 | 30751 | HZ164689 | 2014-02-24 19:45:00 | 2 | DRIVER | 29701.0 | CHEVROLET | UNKNOWN | TN | 2016.0 | NONE | PASSENGER | PERSONAL | S | TURNING LEFT | 1.0 | REAR | 2014 | 2 | 24 |
| 460433 | 24495 | HZ122950 | 2014-01-21 07:40:00 | 1 | DRIVER | 23633.0 | TOYOTA MOTOR COMPANY, LTD. | COROLLA | IL | 2005.0 | NONE | PASSENGER | NOT IN USE | S | STRAIGHT AHEAD | 1.0 | SIDE-LEFT | 2014 | 1 | 21 |
| 460434 | 24496 | HZ122950 | 2014-01-21 07:40:00 | 2 | DRIVER | 23634.0 | NISSAN | ROGUE | IL | 2013.0 | NONE | PASSENGER | PERSONAL | W | STRAIGHT AHEAD | 1.0 | FRONT | 2014 | 1 | 21 |
| 460435 | 481321 | JB442550 | 2014-01-18 18:14:00 | 1 | DRIVER | 460655.0 | MERCEDES-BENZ | UNKNOWN | IL | 2016.0 | UNKNOWN | PASSENGER | UNKNOWN/NA | E | LEAVING TRAFFIC LANE TO PARK | 1.0 | FRONT-RIGHT | 2014 | 1 | 18 |
| 460436 | 481322 | JB442550 | 2014-01-18 18:14:00 | 2 | PARKED | 460661.0 | DODGE | CHARGER | IL | 2018.0 | NONE | PASSENGER | PERSONAL | E | PARKED | 1.0 | FRONT-LEFT | 2014 | 1 | 18 |